3574 results found.
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
Size:
None Production Status:
Existing-used
Use:
Laughter Detection
-
Paper title:Robust Laughter Detection in Noisy Environments
-
Paper track:3.7 Perception of paralinguistic phenomena/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jon Gillick | Switchboard Corpus | /N |
Documentation:
None
Speech
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
260 hoursProduction Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts
-
Paper track:2.10 Other topics in Phonetics, Phonology, and Pro/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Trang Tran | The Switchboard-1 Telephone Speech Corpus | /N |
Documentation:
None
Not Applicable
Treebank,
Language Type:
Monolingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
100000 sentencesProduction Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts
-
Paper track:2.10 Other topics in Phonetics, Phonology, and Pro/Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Trang Tran | Penn TreeBank 3, Switchboard corpus part | /N |
Documentation:
None
Coughs
Cough Sample Database,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
951 MByteProduction Status:
Existing-used
Use:
Machine Learning
-
Paper title:A Multi-Branch Deep Learning Network for Automated Detection of COVID-19
-
Paper track:5.12 Other topics in Analysis of Speech and Audio /Poster Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Gunvant Chaudhari | COUGHVID crowdsourcing dataset | /N |
Documentation:
Yes there is documentation in associated English paper.
Multimodal/Multimedia
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
1.5 hoursProduction Status:
Existing-used
Use:
Acquisition
-
Paper title:Weakly-supervised word-level pronunciation error detection in non-native English speech
-
Paper track:1.10 Bilingual and L2 acquisition and processing/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Daniel Korzekwa | GUT Isle L2 Corpus of Polish Speakers | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
CreativeCommons
Size:
30,043 sentencesProduction Status:
Existing-used
Use:
Spoken Language Understanding
-
Paper title:Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
-
Paper track:11.10 Systems for spoken language understanding/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | YIDI JIANG | Fluent Speech Commands: A dataset for spoken language understanding research | /N |
Documentation:
https://fluent.ai/fluent-speech-commands-a-dataset-for-spoken-language-understanding-research/
Speech/Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
License:
LDC
Size:
5833 sentencesProduction Status:
Existing-used
Use:
Spoken Language Understanding
-
Paper title:Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification
-
Paper track:11.10 Systems for spoken language understanding/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | YIDI JIANG | Air Travel Information Services | /N |
Documentation:
ATIS-2, ATIS-3
Not Applicable
Software Toolkit,
Language Type:
Multilingual
Languages:
English MATLAB Python
Availability:
Freely Available
License:
Open Source
Size:
a few MByteProduction Status:
Newly created-in progress
Use:
Evaluation/Validation
-
Paper title:Out of a hundred trials, how many errors does your speaker verifier make?
-
Paper track:4.6 Evaluation of speaker and language identificat/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Niko Brummer | PYLLR Toolkit | /N |
Documentation:
Readme, in-code documentation and an arXiv paper.
Speech
Corpus,
Language Type:
Multilingual
Languages:
English Finnish German Mandarin Chinese
Availability:
Freely Available
License:
OpenSource
Size:
None Production Status:
Existing-used
Use:
Speech Synthesis
-
Paper title:Cross-lingual Voice Conversion with Disentangled Universal Linguistic Representations
-
Paper track:7.11 Cross-lingual and multilingual aspects in spe/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Zhenchuan Yang | VCC2020 | /N |
Documentation:
http://www.vc-challenge.org/
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English French German Italian Polish Portuguese Spanish
Availability:
Freely Available
License:
CC BY 4.0
Size:
None Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech
-
Paper track:8.1 Feature extraction and low-level feature model/Oral Presentation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Laurent Besacier | Multilingual LibriSpeech (MLS) | /N |
Documentation:
https://arxiv.org/abs/2012.03411, English, public




